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(57) DC and AC DCT transform coefficients of an 
INTRA coded block (310) are differentially encoded by 
selecting predictor DC and AC coefficients from a left- 
hand neighboring block (320) and a top neighboring 
block (340). Each block (310, 320, 340) is coded 
according to a frame mode, a reordered field mode, and 
a non-reordered field mode. The AC predictor block is 
selected according to the respective coding modes of 
the blocks, and the block in which a DC predictor 
resides. The top block (340) is selected as an AC pre- 



dictor when the top block (340) and current block (310) 
are both reordered field mode, or both frame mode 
and/or non-reordered field mode, and the DC predictor 
resides in the top block. Zeroed AC spatial transform 
coefficients are used in place of the AC spatial trans- 
form coefficients from the selected block when the 
selected block is not INTRA coded, or does not reside in 
the same Video Object Plane (VOP) as the current 
block. DC coefficients may be non-linearly quantized. 
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Description 

BACKGROUND OF THE INVENTION 

s The present invention relates to a method and apparatus for coding of DC and AC coefficients for intra-macroblock 
digital video, in particular, when the current macroblock and/or a neighboring macroblock is interlaced (e.g., field) 
coded. Predictor coefficients from neighboring macroblocks are selected for use in differentially encoding the coeffi- 
cients of a current macroblock to optimize coding efficiency. The coefficients are obtained from a spatial transformation 
such as the Discrete Cosine Transform (DCT), for example. 

w The invention is particularly suitable for use with various multimedia applications, and is compatible with the MPEG- 
4 Verification Model (VM) 8.0 standard (MPEG-4 VM 8.0) described in document ISO/IEC/JTC1/SC29/WG1 1 N1796, 
entitled "MPEG-4 Video Verification Model Version 8.0", Stockholm, July 1997, incorporated herein by reference. The 
MPEG-2 standard is a precursor to the MPEG-4 standard, and is described in document ISO/EC 13818-2, entitled 
"Information Technology - Generic Coding of Moving Pictures and Associated Audio, Recommendation H.262," March 

15 25, 1994, incorporated herein by reference. 

MPEG-4 is a new coding standard which provides a flexible framework and an open set of coding tools for commu- 
nication, access, and manipulation of digital audio-visual data. These tools support a wide range of features. The flexi- 
ble framework of MPEG-4 supports various combinations of coding tools and their corresponding functionalities for 
applications required by the computer, telecommunication, and entertainment (i.e., TV and film) industries, such as 

20 database browsing, information retrieval, and interactive communications. 

MPEG-4 provides standardized core technologies allowing efficient storage, transmission and manipulation of 
video data in multimedia environments. MPEG-4 achieves efficient compression, object scalability, spatial and temporal 
scalability, and error resilience. 

The MPEG-4 video VM coder/decoder (codec) is a block- and object-based hybrid coder with motion compensa- 

25 tion. Texture is encoded with an 8x8 Discrete Cosine Transformation (DCT) utilizing overlapped block-motion compen- 
sation. Object shapes are represented as alpha maps and encoded using a Content-based Arithmetic Encoding (CAE) 
algorithm or a modified DCT coder, both using temporal prediction. The coder can handle sprites as they are known 
from computer graphics. Other coding methods, such as wavelet and sprite coding, may also be used for special appli- 
cations. 

30 Motion compensated texture coding is a well known approach for video coding, and can be modeled as a three- 
stage process. The first stage is signal processing which includes motion estimation and compensation (ME/MC) and 
a two-dimensional (2-D) spatial transformation. The objective of ME/MC and the spatial transformation is to take advan- 
tage of temporal and spatial correlations in a video sequence to optimize the rate-distortion performance of quantization 
and entropy coding under a complexity constraint. The most common technique for ME/MC has been block matching, 

35 and the most common spatial transformation has been the DCT 

Additionally, in certain situations, coding efficiency may be improved when macroblocks are INTRA coded, e.g., 
without reference to a temporally subsequent or previous predictor macroblock. INTRA coding may be desirable for an 
image sequence with rapid motion, a scene change, or rapid changes in lighting conditions, where there may be little 
frame-to-frame correlation. Coding efficiency may further be improved by differentially encoding the DCT coefficients of 

40 a current block using the coefficients of a neighboring block as predictor coefficients. 

However, with coding schemes such as MPEG-4, it is possible to have both frame mode and field mode macrob- 
locks next to one another in a video object plane (VOP) or other image area. Additionally, field mode macroblock may 
be reordered or non-reordered in an adaptive coding scheme. Accordingly, it would be desirable to have a system for 
selecting predictor coefficients for differentially coding the AC and DC DCT coefficients of a current INTRA coded block, 

45 where the current block and/or the neighboring blocks are coded in a frame mode, non-reordered field mode, and/or 
reordered field mode. 

The system should further provide non-linear quantization of DC transform coefficients. 
The present invention provides a system having the above and other advantages. 

50 SUMMARY OF THE INVENTION 

In accordance with the present invention, a method and apparatus are presented for selecting predictor coefficients 
for differentially coding the coefficients of a current INTRA coded block. 

A method for coding spatial transform coefficients of a current, INTRA coded block in a digital video image includes 
55 the step of identifying an adjacent left-hand block, an adjacent top block, and a top, left block which is adjacent to the 
left-hand block and the top block. The left-hand block, top block, and top, left block each have a DC spatial transform 
coefficient and a plurality of AC spatial transform coefficients. Additionally, each of the blocks is coded in a coding mode 
according to a frame mode, a reordered field mode, or a non-reordered field mode. Predictor AC spatial transform coef- 
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f icients are selected from the left-hand block or the top block for use in differentially encoding the AC spatial transform 
coefficients of the current block. The selecting step considers the coding mode of the current block and the top block 
In particular, the AC spatial transform coefficients may be selected from the left-hand block when the top block is 
coded according to the frame mode or the non-reordered field mode, and the current block is coded according to the 
5 reordered field mode. 

The AC spatial transform coefficients may be selected from the left-hand block when the current block is coded 
according to the frame mode or the non-reordered field mode, and the top block is coded according to the reordered 
field mode. 

The DC spatial transform coefficient for use in differentially encoding the DC spatial transform coefficient of the cur- 
10 rent block may be selected from the left-hand block or the top block according to a gradient between the DC spatial 
transform coefficients of the left block and the top, left block, and a gradient between the DC spatial transform coeffi- 
cients of the top block and the top, left block. 

The AC spatial transform coefficients may be selected from the left-hand block or the top block according to the 
selected DC spatial transform coefficient. 
15 The AC spatial transform coefficients may be selected from the top block when the DC spatial transform coefficient 
is selected from the top block, and the current block is coded according to the frame mode or the non-reordered field 
mode, and the top block is coded according to the frame mode or the non-reordered field mode. 

The AC spatial transform coefficients may be selected from the top block when the DC spatial transform coefficient 
is selected from the top block, and the current block and the top block are both coded according to the reordered field 
20 mode. 

Zeroed AC spatial transform coefficients may be used in place of the AC spatial transform coefficients from the 
selected block when the selected block is not INTRA coded. 

Furthermore, when the current block resides in a current video object plane (VOP), zeroed AC spatial transform 
coefficients may be used in place of the AC spatial transform coefficients from the selected block when the selected 
25 block does not reside in the current VOP. 

The DC coefficients may be non-linearly quantized. 

A corresponding decoding method is also presented. The decoder may identify the predictor coefficients independ- 
ently by performing the selection process used at the encoder, or the decoder may recover a codeword from the trans- 
mitted brtstream which identifies the predictor coefficients. 
30 Corresponding apparatus are also presented. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a block diagram of an encoder in accordance with the present invention. 
35 FIG. 2 illustrates reordering of pixel lines in an adaptive frame/field DCT scheme in accordance with the present 
invention. 

FIG. 3 illustrates DC coefficient prediction for an INTRA coded block in accordance with the present invention. 
FIGs 4(a)-(d) illustrates four possible arrangements of a current block and candidate predictor blocks in accordance 
with the present invention. 

40 FIG. 5 illustrates AC coefficient prediction for an INTRA coded block in accordance with the present invention. 
FIG. 6 is a block diagram of a decoder in accordance with the present invention. 
FIG. 7 illustrates a macroblock packet structure in accordance with the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

45 

A method and apparatus are presented for selecting predictor coefficients for differentially coding the coefficients 
of a current INTRA coded macroblock. DC and AC coefficients are differentially encoded by selecting predictor DC and 
AC coefficients from a left-hand neighboring block and a top neighboring block. Each block is coded according to a 
frame mode, a reordered field mode, or a non-reordered field mode. The AC predictor block is selected according to the 

so respective coding modes of the blocks, and the block in which the DC predictor resides. The top block is selected as an 
AC predictor when the top block and current block are both reordered field mode, or both frame mode and/or non-reor- 
dered field mode, and the DC predictor resides in the top block. Zeroed AC spatial transform coefficients are used in 
place of the AC spatial transform coefficients from the selected block when the selected block is not INTRA coded, or 
does not reside in the same Video Object Plane (VOP) as the current block. DC coefficients can similarly be set to a 

55 non-zero value in this case. 

FIG. 1 is a block diagram of an encoder in accordance with the present invention. The encoder is suitable for use 
with video object planes (VOPs) such as those defined in the MPEG-4 standard. VOPs are pictorial elements in a 
frame. A VOP can have an arbitrary shape, and a succession of VOPs is known as a video object. A full rectangular 
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video frame may also be considered to be a VOR Thus, the term "VOP" will be used herein to indicate both arbitrary 
and non-arbitrary (e.g., rectangular) image area shapes. Each pixel in a frame is identified as belonging to a particular 
VOR 

A VOP may be classified as a predicted VOP (P-VOP), an intra-coded VOP (l-VOP) or a bi-directionally predicted 
5 VOP (B-VOP). Moreover, a single VOP may include different types of macroblocks. In particular, a VOP may include a 
number of macroblocks (MBs) which are coded individually using an infra-frame mode or an inter-frame mode. With 
intra-frame (INTRA) coding, the MB is coded without reference to a temporally previous or subsequent MB. With inter- 
frame (INTER) coding, the MB may be differentially coded with respect to a temporally subsequent and/or previous ref- 
erence (e.g., anchor) frame. The anchor frame (e.g., VOP) for a P-VOP must be another P-VOP or an l-VOP, not a B- 
10 VOR An l-VOP includes self-contained (e.g., intra-coded) blocks which are not predictive coded. 

Moreover, both intra-frame and inter-frame coded MBs can be coded in a frame mode, a reordered field mode, or 
a non-reordered field mode. 

B-VOPs can use the forward prediction of P-VOPs as well as backward prediction, bi-directional prediction, and 
direct mode, which are all inter-frame techniques. B-VOPs do not currently use intra-frame coded MBs under MPEG-4 

is VM 8.0 referred to previously, although this is subject to change. The anchor frame (e.g., VOP) for a B-VOP must be a 
P-VOP or l-VOP, but not another B-VOP. 

The encoder, shown generally at 100, includes a shape coder 1 10, a motion estimation function 120, a motion com- 
pensation function 1 30, and a texture coder 1 40, which each receive video pixel data input at terminal 1 05. The motion 
estimation function 120, motion compensation function 130, texture coder 140, and shape coder 1 10 also receive VOP 

20 shape information input at terminal 1 07, such as the MPEG-4 parameter VOP_of_arbitrary_shape. When this parame- 
ter is zero, the VOP has a rectangular shape, and the shape coder 110 therefore is not used. 

With INTER coding, a reconstructed anchor VOP function 150 provides a reconstructed anchor VOP for use by the 
motion estimation function 120 and motion compensation function 130. A current VOP is subtracted from a motion com- 
pensated previous VOP at subtracter 160 to provide a residue which is encoded at the texture coder 140. The texture 

25 coder 140 performs the DCT on the residue to provide texture information (e.g., transform coefficients) to a multiplexer 
(MUX) 180. The texture coder 140 also provides information which is summed with the output from the motion compen- 
sator 130 at a summer 170 for input to the previous reconstructed VOP function 150. 

For INTRA coding, the DCT is performed on the DCT coefficients from the pixel data of a current block, not on a 
residue. However, as discussed below, the DCT coefficients themselves may be differentially encoded using AC and DC 

30 coefficient predictors from a selected block which is adjacent to the current block in the same VOP. 

For INTER coded blocks, motion information (e.g., motion vectors) is provided from the motion estimation function 
120 to the MUX 180, while shape information which indicates the shape of the VOP is provided from the shape coding 
function 1 10 to the MUX 180. The MUX 180 provides a corresponding multiplexed data stream to a buffer 190 for sub- 
sequent communication over a data channel. 

35 The pixel data which is input to the encoder may have a YUV 4:2:0 format. The VOP is represented by means of a 
bounding rectangle. The top left coordinate of the bounding rectangle is rounded to the nearest even number not 
greater than the top left coordinates of the tightest rectangle. Accordingly, the top left coordinate of the bounding rec- 
tangle in the chrominance component is one-half that of the luminance component. 

For some video sequences, such as where there is fast motion or a scene change, it may be more efficient (i.e., 

40 use fewer bits) to directly code the pixel intensity values of a current block instead of using inter-VOP differential encod- 
ing. Accordingly, it is desirable to have a decision criteria for adaptively choosing to code the current block directly (e.g., 
INTRA mode) or differentially (e.g., INTER mode). The following parameters are calculated to make the INTRA/INTER 
decision: 

45 AM AM AM AM 

mean = — £ Z ^ A B Z Z l c ft/>™ean|, 
N /.o /-o /-o y=o 



so where /Vis the size of the block (e.g., N=16fora 16x16 macroblock). The term c(i,j) represents the current macroblock 
luminance samples, where V is a horizontal index, and "j" is a vertical index. The INTRA mode is chosen if 

A <(SAD inmr (MV x ,MV y )-2*Nc)\<^ecw$e, the INTER 

55 mode is used. SAD inter is the sum of the absolute differences taken on a pixel-by-pixel basis between the pixels of the 
current macroblock and the pixels of the best match macroblock. The best match macroblock is referenced by the hor- 
izontal and vertical motion vectors, MV X and MVy, respectively. Nc is the number of pixels in the current VOP. A VOP 
typically comprises several macroblocks. Note that the MV in SAD^^MV^MVy) for this decision is at integer pixel res- 
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olulion. If the INTRA mode is chosen, no further operations are necessary for the motion search. If the INTER mode is 
chosen, the motion search continues for a half-pixel resolution MV. 

FIG. 2 illustrates reordering of pixel lines in an adaptive frame/field DOT scheme in accordance with the present 
invention. 

5 When interlaced (i.e., field mode) video is coded, superior energy compaction can sometimes be obtained by reor- 
dering the lines of a macroblock to form 8x8 luminance blocks consisting of data from one field. The DCT is then per- 
formed on the reordered macroblock which comprises same-field pixel lines (i.e., either top of bottom fields). Field DCT 
line order is used when 

10 6 15 6 15 

ZZ (P2iJ-P2U}j) 2 +(P2MJ'P2i+2j) 2 > ZZ ^2//P2/ + 2,y) 2+ (P2/ + 1/P2/ + 3,/ 

/o0/*o /»oy»o 



15 where p- ( j are the spatial luminance data samples for INTRA coding (or differences for INTER coding) just before the 
8x8 DCT is performed. The terms T and T are horizontal and vertical indexes, respectively. The field DCT permutation 
is indicated when the MPEG-4 flag dct_type=1 (i.e., reordered field mode). Non-reordered field mode, and frame mode, 
are indicated by dct_type=0. The decision to reorder lines is made after the decision to use INTRA or INTER coding. 
When field DCT mode is used, the luminance lines for INTRA mode (or luminance error for INTER mode) in the 
20 spatial domain of the macroblock are permuted from the frame DCT orientation to field DCT configuration. The resulting 
macroblock is transformed, quantized and VLC encoded normally. On decoding a field DCT macroblock, the inverse 
permutation is performed after all luminance blocks have been obtained from the IDCT In the 4:2:0 format, chromi- 
nance data are not effected by this mode. 

A field mode image, e.g., a 16x16 macroblock (MB), is shown generally at 20. The MB includes even-numbered 
25 lines 202, 204, 206, 208, 210, 212, 214 and 216, and odd-numbered lines 203, 205, 207, 209, 211, 213, 215 and 217. 
The even and odd lines are thus interleaved, and form top and bottom fields, respectively. 

When the pixel lines in image 200 are permuted to form same-field luminance blocks, the MB shown generally at 
250 is formed. Arrows, shown generally at 245, indicate the reordering of the lines 202-21 7. For example, the even line 
202, which is the first line of MB 200, is also the first line of MB 250. The even line 204 is reordered as the second line 
30 in MB 250: Similarly, the even lines 206, 208, 210, 212, 214 and 216 are reordered as the third through eighth lines, 
respectively, of MB 250. Thus, a 16x8 luminance region 280 with even-numbered lines is formed. Similarly, the odd- 
numbered lines 203, 205, 207, 209, 21 1 , 213, 215 and 21 7 form a 16x8 region 285. 

For texture coding, an 8x8 DCT is performed on the two blocks in the top region 280, and the two blocks in the bot- 
tom region 285. A separable two-dimensional DCT is used. As described below, further coding efficiencies can be 
35 achieved for INTRA blocks by differentially encoding the DCT coefficients using predictor coefficients from blocks which 
are adjacent to the current block. 

FIG. 3 illustrates DC coefficient prediction for an INTRA coded block in accordance with the present invention. DC 
coefficient prediction is not affected by whether a block is frame mode or field mode. Before entropy coding, lossless 
prediction of quantized DC and some AC coefficients of the DCT is performed to concentrate the coefficient distribution 
40 around zero so that entropy coding can be more efficient. The adaptive DC prediction method for a current block 
involves selection of either the quantized DC (QDC) value of a block which is to the left of the current block, or the block 
which is immediately above the current block, i.e., in the previous row. 

Furthermore, quantization of the DC coefficients may be linear or non-linear. For linear quantization, with a quan- 
tizer step-size for the DC coefficient of the luminance and chrominance components of eight, QDC = dc // 8, where "dc" 
45 is an 1 1 -bit unquantized transform coefficient value from the DCT. 7f denotes rounded division. 

Non-linear quantization for DC coefficients of an intra-macroblock is described with reference to Table 1 . 



TABLE 1 



Component:Type 


dc_scaler for Quantizer (Qp) range 




1 through 4 


5 through 8 


9 through 24 


25 through 31 


Luminance 


8 


2Qp 


Qp+8 


2Qp-16 


Chrominance 


8 


(Qp+13)/2 


Qp-6 



A non-linear scaler parameter (dc_scaler) for DC coefficients of DCT blocks, is expressed in terms of a piece-wise 
relationship with a quantization level, Qp. DC coefficients of luminance and chrominance blocks are quantized by cor- 
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responding non-linear scalers. For example, for a luminance block with a Qp range of 1 -4, dc_scaler=8. For a luminance 
block with a Qp value of 6, dc_scaler=2*Qp=12. Then, the forward quantization is performed using QDC = dc // 
dc_scaler, and the reconstructed DC values are computed using dc = dc_sca!er*QDC. 

Adaptive selection of the DC predictor coefficient is based on a comparison of horizontal and vertical QDC value 

5 gradients around the current block. For example, assume that block X 310 is an 8x8 block of DCT coefficients in an 
INTRA macroblock 300 which is currently being coded. Block X includes a DC coefficient 312. Block A 320 is an 8x8 
block which is the immediate left-hand neighbor of X, block C 340 is an 8x8 block which is the immediate top neighbor 
of block X, and block B 330 is an 8x8 block that is the immediate left-hand neighbor of block C. Block A includes a DC 
coefficient 322, block C includes a DC coefficient 342, and block B includes a DC coefficient 332. 

10 For linear quantization of DC coefficients, the QDC predictor value of block X, QDC' X , is obtained from either the 
QDC value of block A, QDC A , or the QDC value of block C, QDC C , based on a comparison of horizontal and vertical 
QDC gradients. In particular, if (|QDC A - QDC B | < |QDC B - QDC C ), then QDC' X =QDC C . Otherwise, QDC' X =QDC A . 

A differential DC value, DQDC X , is then obtained as follows: DQDC X = QDC X - QDC' X . The differential DC value 
is variable length coded and transmitted in a bitstream to a decoder. The process is independently repeated for every 

15 block of a macroblock, and then for each macroblock in a VOP or frame. At the decoder, the full DC coefficient is 
obtained from the operation QDC X = DQDC X + QDC' X . 

If any of the blocks A, B, or C are outside of a VOP or frame boundary, or they do not belong to an INTRA coded 
macroblock, their QDC prediction values are assumed to take a value of 2^ lt8 -P Gr - pixel - 1 ). For example, with 
bits _per _pixel=8, a value of 2 7 =128 is used. TTie DC prediction block is selected in a similar manner for the luminance 

20 and each of the two chrominance components of a block. 

The above scheme for linear quantization is modified for non-linear quantization of DC coefficients when dc_scaler 
is different for blocks A 320, B 330 and C 340. That is, dc_scaler is not the same for each of the three blocks. This may 
occur since the quantization level can change from macroblock to macroblock. A solution is to use the following rule for 
DC prediction. 

25 

if (\QDC< A l> -QDC ( B X) \<\QDC ( g l) -QDC ( c l '\) 
ODC x . = QDC™ 

30 & 

ODC x . = QDC A U . 
where 

dc x I ldc_scalar x> 

(ODC A * dc_scalar A ) I ldc_scalar x .and QDC A = dc A I ldc_scalar A , 
(ODC B * dc_scalar B ) I /dc_scalar x , and QDC B = dc g I ldc_scalar B , 
(QDC C * dc_scalar c )l ldc_scalar x ,and QDC C = dc c I fdc_scalar c . 

Essentially, the QDC values are scaled according to the quantization levels of the macroblock in which the DC coef- 
45 f icierrt resides and the macroblock in which the selected prediction block resides. Generally, dc_scaler is assigned on a 
MB by MB basis. The parameters QDC and Qp can be transmitted to a decoder in a digital bitstream. Qp is differentially 
encoded from MB to MB and transmitted as the MPEG-4 parameter DQUANT. At the decoder, Qp is obtained from 
DQUANT, and dc_scaler is obtained from Qp according to Table 1 . The QDC predictor is determined by re-calculating 
the horizontal and vertical gradients and considering the other predictor selection criteria, and, finally, the scaled QDC 
so value, QDC* 1 ), is obtained using the non-scaled QDC value and the dc_scaJer values. 

FIGs 4(a)-4(d) illustrate four possible arrangements of a current block and potential predictor blocks in accordance 
with the present invention. Selection of a DC coefficient predictor is not affected by the relative position of the current 
block X in the macroblock. 

FIG. 4(a) shows a first arrangement which corresponds to the arrangement of FIG. 3. Specifically, block X 310, 
55 which is the block currently being coded, is the upper left-hand block of the macroblock 300, block A 320 is the upper 
right-hand block of a macroblock 420, block B is the lower right-hand block of a macroblock 430, and block C 340 is the 
lower left-hand block of a macroblock 440. 

FIG. 4(b) shows a second arrangement, where the block X 31 0 is the upper right-hand block of the macroblock 300, 



ODC x . = 
ODC A {) = 
QDC B l) = 
QDC™ = 
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block A 320 is the upper left-hand block of the macroblock 300, bloc/c B 330 Is the lower left-hand block of the macrob- 

lock 440, and bloc/c C is the lower right-hand block of the macroblock 440. 

Generally, all blocks in the same macroblock are either all frame mode, all field mode without reordering, or all field 

mode with reordering. Additionally, all blocks in the same macroblock are either INTRA mode or INTER mode. Moreo- 
s ver, blocks in adjacent macroblocks may have different modes. Thus, in FIG. 4(b), block A 320 and block X 310 are 

either both frame mode, both field mode without reordering, or both field mode with reordering. Similarly, block B 330 

and block C 340 are either both frame mode, both field mode without reordering, or both field mode with reordering. 
FIG. 4(c) shows a third arrangement, where the block X 310 is the lower left-hand block of the macroblock 300, 

block A 320 is the lower right-hand block of the macroblock 420, block B is the upper right-hand block of the macroblock 
10 420, and block C is the upper left-hand block of the macroblock 420. Block A 320 and block B 330 are either both frame 

mode, both field mode without reordering, or both field mode with reordering. Similarly, block X 310 and block C 340 

are either both frame mode, both field mode without reordering, or both field mode with reordering. 

FIG. 4(d) shows a fourth arrangement, where the bloc/c X 31 0 is the lower right-hand block, block A 320 is the lower 

left-hand block, block B 330 is the upper left-hand block, and block C 340 is the upper right-hand block, each in the mac- 
is roblock 300. Since each of the blocks A, B, C and D are in the same macroblock, they are ail frame mode, all field mode 

without reordering, or all field mode with reordering. 

FIG. 5 illustrates AC coefficient prediction for an INTRA coded block in accordance with the present invention. With 

AC prediction, either the left-hand neighboring block or the top neighboring block is selected for use in differentially 

encoding the AC DCT coefficients of a current block. Those skilled in the art will recognize that all coefficients in a DCT 
20 block are AC coefficients, except the upper left-hand coefficient, which is a DC coefficient. Thus, for an 8x8 block, there 

are sixty-three AC coefficients and one DC coefficient. "DC" refers to direct current, which describes a steady state 

coefficient energy, while "AC refers to alternating current, which describes a coefficient energy which varies relative to 

the DC level. 

Typically, the distribution of energy in the AC coefficients is such that many coefficients near the lower right-hand 
25 portion of a block have a zero or near-zero value, and therefore need not be differentially encoded, while the coefficients 
near the upper left-hand portion of a block are differentially encoded. 

No macroblock boundary is shown in FIG. 5 since the blocks may assume one of the four arrangements as shown 
in FIGs 4(a)-(d). Additionally, while each block includes sixty-four coefficients, not all are individually shown. Block X 
310 includes a first row of AC coefficients 314 and a first column of AC coefficients 316, block C includes a first row of 
30 AC coefficients 344, and block A includes a first column of AC coefficients 324. 

Either DC coefficient 322 in block A 320 or DC coefficient 342 in block C 340 will be selected as the predictor for 
the DC coefficient 31 2 in block X 3 1 0. Moreover, AC coefficients from either the first row or the first column of a previous 
coded block (i.e., within the same VOP) are used to predict the corresponding coefficients of the current block. 

For each block, the selected direction for DC coefficient prediction is also used to select the direction for AC coef- 
35 f icient prediction when the current block and the predictor block have the same dctjype. That is, when the current block 
and the predictor block both have a dct_type=0 (i.e. , non-reordered field mode, or frame mode) or dct_type=1 (i.e. , reor- 
dered field mode), the AC coefficients of the DC predictor block are used to differentially encode the AC coefficients of 
the current block. The MPEG-4 flag "dctjype" was discussed above in connection with FIG. 2. 

in a first example, assume the current block, bloc/c X 310, is arranged as shown in FIG. 4(a). If blocks X, A and C 
40 have the same dctjype, then the AC coefficients of the DC predictor block are used to differentially encode the AC 
coefficients of the current block. 

For example, if block C 340 is the DC predictor Nock, then the first row of AC coefficients 344 of block C is used to 
predict the first row of the AC coefficients 314 of block X on a coefficient-by-coefficient basis. That is, the first AC coef- 
ficient of row 344 is used to predict the first AC coefficient of row 314 as indicated by arrow 346, the second AC coeffi- 
45 cient of row 344 is used to predict the second AC coefficient of row 314, and so forth. Similarly, the second row of AC 
coefficients of block C 340 is used to predict the second row of the AC coefficients of block X 310, and so forth, until 
the last row of AC coefficients of block C is used to predict the last row of the AC coefficients of block X. Note that the 
first row of each block has seven AC coefficients, while the remaining rows have eight AC coefficients. 

Alternatively, still assuming blocks X, A and C all have the same dctjype, if block A is the DC predictor block, then 
so the first column of AC coefficients 324 of block C is used to predict the first column 31 6 of the AC coefficients of block 
X on a coefficient-by-coefficient basis. For example, the first AC coefficient of column 324 is used to predict the first AC 
coefficient of column 316 as indicated by arrow 326. Similarly, the second column of AC coefficients of block A is used 
to predict the second column of the AC coefficients of X and so forth, until the last column of AC coefficients of block A 
is used to predict the last column of the AC coefficients of block X. Note that the first column of each block has seven 
55 AC coefficients, while the remaining columns have eight AC coefficients. 

Still assuming the arrangement of FIG 4(a) is used, if block X and block C have different dctjypes, then block A is 
selected as the AC predictor block, regardless of the DC predictor block. Block A is selected as the AC predictor in this 
case even if block A and block X have different dctjypes, since the reordering of lines in a reordered field mode block 
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does not significantly affect the distribution of horizontal spatial frequencies relative to a non-reordered field mode block 
or a frame mode block. Accordingly, an improved coding efficiency can still be expected by using block A as an AC pre- 
dictor block. 

If block X and block C have the same dctjype, but block X and block A have different dctjypes, then block C is 
s selected as the AC predictor block, regardless of the DC predictor block. 

Thus, it is possible for the DC predictor to come from block C, for example, while the AC predictors are from block 
A. Alternatively, it is possible for the DC predictor to come from block A, while the AC predictors are from block C. 

Generally, the reason for avoiding block C as an AC predictor block when block C and block X have different 
dctjypes is that reordering of the pixel lines changes the vertical distribution of spatial energy in a block, so the corre- 
10 lation between the spatial frequencies between two blocks with different dctjypes is significantly reduced. 

For example, assume block X is an image of a fast-moving object with a vertical boundary which moves horizontally 
across the block In this situation, block X may be coded as a reordered field mode block to avoid a zippering or zigzag 
effect of the vertical boundary. Such zippering, which is caused by a temporal delay between the top and bottom fields, 
is manifested as a vertical zigzag line in the block. However, if the vertical boundary of the moving image did not extend 
15 into block C, block C may be coded in non-reordered field mode, or in frame mode. In this case, block C cannot provide 
good AC predictor coefficients for block X due to poor correlation of the AC coefficients, so block A is used as the AC 
predictor block. Block A will generally provide a higher correlation. 

In a second example, assume the current block is arranged as shown in FIG. 4(b). Here, blocks A and X have the 
same dctjype as one another, and blocks B and C have the same dctjype as one another. If blocks X and C also 
20 have the same dctjype as one another, then the AC coefficients of the DC predictor block are used to differentially 
encode the AC coefficients of the current block. If blocks X and C have different dctjypes, then block A is used as the 
AC predictor block, regardless of the DC predictor block. 

In a third example, assume the current block is arranged as shown in FIG. 4(c). Blocks X and C have the same 
dctjype as one another, and blocks A and B have the same dctjype as one another. If blocks X and A also have the 
25 same dctjype as one another, then the AC coefficients of the DC predictor block are used to differentially encode the 
AC coefficients of the current block. If blocks X and A have different dctjypes, then block C is used as the AC predictor 
block, regardless of the DC predictor block. 

In a fourth example, assume the current block is arranged as shown in FIG. 4(d). Blocks A, B, C and X all have the 
same dctjype since they are in the same macroblock 300. Thus, the AC coefficients of the DC predictor block are used 
30 to differentially encode the AC coefficients of the current block. 

As a general rule, block C is used as the AC predictor block only when block X and block C have the same 
dctjype, and the DC predictor is from block C. Otherwise, block A is used as the AC predictor block. Moreover, AC pre- 
dictions are performed similarly for the luminance and each of the two chrominance components. 

Note that when block A would be selected by default (i.e., when block X and block C have different dctjypes), but 
35 there is no block A in reality, such as when block X is in the left half of the first macroblock in a row, zero is used for the 
AC predictors. 

In the examples of FIGs 4 and 5, each block is assumed to be INTRA coded, and within a common VOP boundary. 
However, if any of the blocks A, B or C are outside of the VOP boundary in which block X is contained, or do not belong 
to an INTRA coded macroblock, their quantized AC (QAC) values are assumed to take a value of zero for the purpose 
40 of computing prediction values. The QDC values are set to a non-zero constant, as discussed. 

Furthermore, to compensate for differences in the quantization of previous horizontally adjacent or vertically adja- 
cent blocks used in AC prediction of the current block, scaling of prediction coefficients is required. The prediction is 
modified so that the predictor is scaled by the ratio of the current quantization step size and the quantization step size 
of the predictor block. 

45 In particular, if block A was selected as the AC predictor for the current block (e.g., block X), the scaled horizontal 
AC prediction is: 

nAr QAC i0A xQP A 

where QAC i0 A is the non-scaled quantized AC value for the (i,0)th coefficient, QP A is the quantization parameter for 
block A, and QP X is the quantization parameter for block X. 

If block C was selected as the AC predictor, the scaled vertical AC prediction is: 

UAL ojx gp~ 
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where QAC j0C is the non-scaled quantized AC value for the (Oj)th coefficient, QP C is the quantization parameter for 
block C, and QP X is the quantization parameter for block X. The pair (i,j) denotes a particular coefficient in a block by 
the horizontal V position, and the vertical T position. For example, (i j)=(0,0) denotes the upper left-hand coefficient in 
a block, and (i,j)=(8,8) denotes the lower right-hand coefficient in a block. 

s If block A or block C are outside of the VOP in which block X resides, then the corresponding QP values are 
assumed to be equal to QP X . 

Although differential encoding of DCT coefficients in an INTRA block as discussed in connection with FIQs 3-5, in 
particular, can usually improve coding efficiency by reducing the amount of data needed to represent the block, this is 
not always the case. Thus, when AC coefficient prediction results in a larger magnitude error signal as compared to the 

10 original signal, it is desirable to disable AC prediction. However, to avoid excessive overhead data, AC prediction can 
be switched on or off on a macroWock-by-macroblock basis rather than on a block-by-block basis. 

The decision for switching AC prediction on or off is based on a comparison of the sum of the absolute values of all 
AC coefficients to be predicted in a macroblock, and the sum of the absolute values of the predicted differences. In par- 
ticular, if block A is selected as the AC predictor block, a criterion S is calculated, as follows: 



15 



20 
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If block C is selected as the DC predictor for the current block, the criterion S is calculated, as follows: 

r 7 



25 V 
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Next, for all blocks in the macroblock for which a common decision is to be made, a single IS is calculated and a flag, 
such as the MPEG-4 flag "ACpredJIag" is either set/reset to enable/disable AC prediction. In particular, if (IS £ 0), set 
30 ACpredJlag=1 to enable AC prediction. Otherwise, set ACpredJlag=0 to disable AC Prediction. DC prediction is still 
enabled in either case. ACpredJIag is transmitted to the decoder for use in recovering the DCT coefficients of each 
block 

FIG. 6 is a block diagram of a decoder in accordance with the present invention. The decoder, shown generally at 
600, can be used to receive and decode the encoded data signals transmitted from the encoder of FIG. 1 . The encoded 

35 video image data and differentially encoded motion vector (MV) data are received at terminal 640 and provided to a 
demultiplexer (DEMUX) 642. For INTER macroblocks, the encoded video image data is differentially encoded in DCT 
transform coefficients as a prediction error signal (e.g., residue). For INTRA macroblocks, the video image itself is not 
differentially encoded, but the DCT coefficients may be differentially encoded using neighboring transform coefficients 
in the same VOP, i.e., according to ACpredJIag. 

40 A shape decoding function 644 processes the data when the VOP has an arbitrary shape to recover shape infor- 
mation, which is, in turn, provided to a motion compensation function 650 and a VOP reconstruction function 652. A tex- 
ture decoding function 646 performs an inverse DCT on transform coefficients to recover residue information for INTER 
coded macroblocks. 

For INTRA coded macroblocks (MBs), pixel information is recovered directly and provided to the VOP reconstruc- 
ts tion function 652. In particular, when ACpredJ lag=0, the inverse DCT at the texture decoding function 646 recovers the 
AC coefficients of the current block directly since the AC coefficients are not differentially encoded. The DC coefficient 
is differentially encoded using the DC coefficient (e.g., predictor) of either the top or left-hand block. The DC predictor 
may be zeroed when the selected predictor block is outside the current VOP, or is from an INTER coded block. The full 
DC coefficient is recovered by summing the differentially encoded DC coefficient of the current block with the predictor 
so DC coefficient, i.e., according to the operation QDC X - DQDC X + QDC' X . 

For INTRA coded macroblocks with ACpredJ lag=1 , both the AC and DC coefficients of the current block are differ- 
entially encoded. The DC coefficient is recovered as discussed above for the case when ACpred Jlag=0. The full AC 
coefficients are recovered by performing the inverse DCT to obtain the differentially encoded AC coefficients of the cur- 
rent block, and summing the differentially encoded AC coefficients with the corresponding AC coefficients of the predic- 
55 tor block, i.e., according to the operation QAC X = DQAC X + QAC' X . The AC predictor may be zeroed when the selected 
predictor block is outside the current VOP or is in an INTER coded block 

Thus, for INTRA coded blocks, it is necessary for the decoder to identify the proper predictor block separately for 
the AC and DC coefficients. This can be achieved by providing a codeword or codewords in the bitstream for each mac- 
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roblock which indicates the predictor block. For example, a codeword of "00" may indicate that the top block is the pre- 
dictor for both AC and DC coefficients, a codeword of "01" may indicate that the left-hand block is the predictor for both 
AC and DC coefficients, a codeword of "10" may indicate that the left-hand block is the predictor for the DC coefficient 
while the top block is the predictor for the AC coefficients, and a codeword of "1 1 " may indicate that the left-hand block 
s is the predictor for the AC coefficient while the top block is the predictor for the DC coefficients. 

Additionally, other bits or codewords or the like may be used to indicate that a zeroed predictor is to be used for 
either the DC or AC coefficients. Or, the decoder may independently check to see if a selected predictor block is INTER 
coded or outside the current VOP, and set the predictor coefficients to zero or to a non-zero constant as required. 
Alternatively, the texture decoding function 646 at the decoder 600 may independently perform the selection algo- 

10 rithm discussed above in connection with FIGs 3-5 to determine the correct predictor coefficients. In this case, the tex- 
ture decoding function 646 may have a memory for storing the decoded DCT coefficients of the left, top, and top, left 
blocks for use in processing the current block. The flag dctjype must also be available for each macroblock in this case. 
Appropriate circuitry can be implemented in software, firmware, or hardware to identify the desired predictor coefficients 
in order to decode the current block. 

15 For INTER coded blocks and MBs, such as those in B-VOPs, the pixel information provided from the texture decod- 
ing function 646 to the reconstructed VOP function 652 represents a residue between the current MB and a reference 
image. The reference image may be pixel data from a single anchor MB which is indicated by a forward or backward 
MV. Alternatively, for an interpolated MB, the reference image is an average of pixel data from two reference MBs, e.g., 
one past anchor MB and one future anchor MB. In this case, the decoder must calculate the averaged pixel data accord- 
so ing to the forward and backward MVs before recovering the current MB pixel data. 

For INTER coded blocks and MBs, a motion decoding function 648 processes the encoded MV data to recover the 
differential MVs and provide them to the motion compensation function 650 and to a motion vector memory 649, such 
as a RAM. The motion compensation function 650 receives the differential MV data and determines a PMV according 
to the coding mode and pmv[ ] index of the current MB and the previous MB, in bitstream order. 

25 Once the motion compensation function 650 determines a full reference MV and sums it with the differential MV of 
the current MB, the full MV of the current MB is available. Accordingly, the motion compensation function 650 can now 
retrieve anchor frame best match data from a VOP memory 654, such as a RAM, calculate an averaged image if 
required, and provide the anchor frame pixel data to the VOP reconstruction function to reconstruct the current MB. 
The retrieved or calculated best match data is added back to the pixel residue at the VOP reconstruction function 

30 652 to obtain the decoded current MB or block. The reconstructed block is output as a video output signal and also pro- 
vided to the VOP memory 654 to provide new anchor frame data. 

FIG. 7 illustrates a macroblock packet structure in accordance with the present invention. The structure indicates 
the format of data received by the decoder. Note that the packets are shown in four rows for convenience only. The 
packets are actually transmitted serially, starting from the top row, and from left to right within a row. A first row 710 

35 includes fields f irst_shape_code, MVD_shape, CR, ST and BAC. A second row 730 includes fields COD and MCBPC. 
A third row 750 includes fields ACpredJIag 753, CBPY, DQUANT, Interlacedjnformation, MVD, MVD2, MVD3 and 
MVD4. A fourth row 770 includes fields CODA, Alpha_ACpred_f lag, CBPA, Alpha Block Data and Block Data. Each of 
the above fields is defined according to MPEG-4 VM 8.0. 

The term first_shape_code indicates whether a MB is in a bounding box of a VOP CR indicates a conversion ratio 

40 for Binary Alpha Blocks. ST indicates a horizontal or vertical scan order. BAC refers to a binary arithmetic codeword. 

COD and CODA refer to gray scale shape coding. MCBPC refers to a macroblock type and coded block pattern for 
chrominance. As discussed, ACpredJIag 753 is a one bit flag which, when set to zero, indicates that only DC prediction 
is performed for a current INTRA block ACpred Jlag=1 indicates that AC and DC prediction are performed for a current 
INTRA block. CBPY refers to a coded block pattern for luminance. DQUANT defines changes in the value of a quan- 

45 tizer, Qp, from macroblock to macroblock. 

The field Interlacedjnformation in the third row 750 indicates whether a macroblock is interlaced coded. The 
Interlacedjnformation field may be stored for subsequent use as required in the motion vector memory 1349 or other 
memory in the decoder. The Interlacedjnformation field may also include the flag dctjype which, as discussed, indi- 
cates whether top and bottom field pixel lines in a field coded macroblock are reordered. 

so MVD, MVD2, MVD3 and MVD4 refer to motion vector data. Alpha_ACpredJlag refers to an ACpredJIag for alpha 
shape coding. Alpha Block Data refers to binary and gray scale shape information known as alpha planes. 

The arrangement shown in FIG. 7 is an example only. Various other arrangements for communicating the relevant 
information to the decoder will become apparent to those skilled in the art. 

A bitstream syntax and MB packet syntax for use in accordance with the present invention is described in MPEG- 

55 4 VM 8.0 as well as commonly assigned U.S. Patent application Serial No. 08/897,847 to Eifrig et al., filed July 21 , 1997, 
entitled "Motion Estimation and Compensation of Video Object Planes for Interlaced Digital Video", incorporated herein 
by reference. 

Accordingly, it can be seen that the present invention provides a scheme for efficiently coding DC and AC DCT 
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transform coefficients of an INTRA coded block. The DC and AC coefficients are differentially encoded by selecting pre- 
dictor DC and AC coefficients from a left-hand neighboring block and a top neighboring block. Each block is coded 
according to a frame mode, a reordered field mode, or a non-reordered field mode. The AC predictor block is selected 
according to the respective coding modes of the blocks, and the block in which the DC predictor resides. The top block 
5 is selected as an AC predictor when the top block and current block are both reordered field mode, or both frame mode 
and/or non-reordered field mode, and the DC predictor resides in the top block. Zeroed AC spatial transform coefficients 
are used in place of the AC spatial transform coefficients from the selected block when the selected block is not INTRA 
coded, or does not reside in the same Video Object Plane (VOP) as the current block. DC coefficients can similarly be 
set to a non-zero value in this case. 

10 In particular, the present invention improves coding efficiency over previous schemes which do not provide the top 
block as a candidate predictor block, or account for whether the current and candidate blocks are coded in a frame 
mode, non-reordered field mode, or reordered field mode. Correlation of spatial frequencies between the current block 
and the top block can result in improved coding efficiency. Additionally, the scheme encompasses the MPEG-4 standard 
by providing for zeroing of predictor coefficients when the candidate predictor is outside a current video object plane 

15 (VOP), is not INTRA coded, or is otherwise not available. 

Additionally, the scheme is compatible with both linear and non-linear quantization of the DCT coefficients. In par- 
ticular, non-linear quantization of DC coefficients is disclosed, where the coded coefficient is scaled at a decoder 
according to the quantization levels of the macroblock in which the coefficient resides and of the macroblock in which 
the selected prediction block resides to recover the scaled coefficient. 

20 Although the invention has been described in connection with various specific embodiments, those skilled in the art 
will appreciate that numerous adaptations and modifications may be made thereto without departing from the spirit and 
scope of the invention as set forth in the claims. For example, while the invention was discussed in connection with DCT 
transform coefficients, the invention may be adapted for use with coefficients of other spatial transforms. 

25 Claims 

1. A method for coding spatial transform coefficients of a current, INTRA coded block in a digital video image, com- 
prising the steps of: 

30 identifying an adjacent left-hand block, an adjacent top block, and a top, left block which is adjacent to said left- 

hand block and said top block; 

said left-hand block, top block, and top, left block each having a DC spatial transform coefficient and a plurality 
of AC spatial transform coefficients; 

each of said blocks being coded in a coding mode according to one of a frame mode, a reordered field mode, 
35 and a non-reordered field mode; and 

selecting AC spatial transform coefficients from one of said left-hand block and said top block for use in differ- 
entially encoding respective AC spatial transform coefficients of said current block; 
said selecting step being responsive to the coding mode of said current block and said top block. 

40 2. The method of claim 1 , comprising the further step of: 

selecting the AC spatial transform coefficients from said left-hand block when said top block is coded according 
to one of the frame mode and the non-reordered field mode, and said current block is coded according to the 
reordered field mode. 

45 

3. The method of claim 1 or 2, comprising the further step of: 

selecting the AC spatial transform coefficients from said left-hand block when said current block is coded 
according to one of the frame mode and the non-reordered field mode, and said top block is coded according 
so to the reordered field mode. 

4. The method of one of the preceding claims, comprising the further step of: 

selecting a DC spatial transform coefficient for use in differentially encoding the DC spatial transform coeffi- 
55 cient of said current block from one of said left-hand block and said top block according to (a) a gradient 

between the DC spatial transform coefficients of said left block and said top, left block, and (b) a gradient 
between the DC spatial transform coefficients of said top block and said top, left block. 
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5. The method of claim 4, comprising the further step of: 

selecting AC spatial transform coefficients for use in the differential encoding from one of said left-hand block 
and said top block according to the selected DC spatial transform coefficient. 

5 

6. The method of claim 4, comprising the further step of: 

selecting AC spatial transform coefficients for use in the differential encoding from said top block when the DC 
spatial transform coefficient is selected from said top block, and said current block is coded according to the 
10 frame mode or the non-reordered field mode, and said top block is coded according to the frame mode or the 

non-reordered field mode. 

7. The method of claim 4, comprising the further step of: 

is selecting AC spatial transform coefficients for use in the differential encoding from said top block when the DC 

spatial transform coefficient is selected from said top block, and said current block and said top block are both 
coded according to the reordered field mode. 

8. The method of one of the preceding claims, comprising the further step of: 

20 

using zeroed AC spatial transform coefficients in place of the AC spatial transform coefficients from the 
selected block when the selected block is not INTRA coded. 

9. The method of one of the preceding claims, wherein said current block resides in a current video object plane 
25 (VOP), comprising the further step of: 

using zeroed AC spatial transform coefficients in place of the AC spatial transform coefficients from the 
selected block when the selected block does not reside in said current VOP. 

30 1 0. The method of one of claims 4 to 7, comprising the further step of: 

non-linearly quantizing the DC spatial transform coefficient of the current block. 

1 1 . A method for decoding differentially encoded spatial transform coefficients of a current, INTRA coded block in a dig- 
35 ital video image which is transmitted in a digital bitstream to recover corresponding full AC spatial transform coeffi- 
cients of said current block, comprising the steps of: 

recovering said digital bitstream; 

said digital bitstream comprising data from an adjacent left-hand block, an adjacent top block, and a top, left 
40 block which is adjacent to said left-hand block and said top block; 

said left-hand block, top block, and top, left block each having a DC spatial transform coefficient and a plurality 
of AC spatial transform coefficients; 

each of said blocks being coded in a coding mode according to one of a frame mode, a reordered field mode, 
and a non-reordered field mode; 
45 selecting AC spatial transform coefficients from one of said left-hand block and said top block; and 

summing said selected AC spatial transform coefficients with said corresponding differentially encoded AC 
spatial transform coefficients of said current block to recover said full AC spatial transform coefficients of said 
current block; 

said selecting step being responsive to the coding mode of said current block and said top block. 

50 

12. The method of claim 1 1 , comprising the further step of: 

selecting AC spatial transform coefficients for said summing step from said left-hand block when said top block 
is coded according to one of the frame mode and the non-reordered field mode, and said current block is coded 
55 according to the reordered field mode. 

13. The method of claim 1 1 or 12, comprising the further step of: 
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selecting AC spatial transform coefficients for said summing step from said left-hand block when said current 
block is coded according to one of the frame mode and the non-reordered field mode, and said top block is 
coded according to the reordered field mode. 

14. The method of one of claims 1 1 to 13, wherein the DC spatial transform coefficient of said current block is differen- 
tially encoded prior to transmission in the digital bitstream using the DC spatial transform coefficient from one of 
said left-hand block and said top block according to (a) a first gradient between the DC spatial transform coefficients 
of said left block and said top, left block, and (b) a second gradient between the DC spatial transform coefficients 
of said top block and said top, left block, comprising the further steps of: 

determining said first and second gradients; and 

selecting the DC spatial transform coefficient which was used in differentially encoding the DC spatial trans- 
form coefficient of said current block from one of said left-hand block and said top block according to the deter- 
mined gradients. 

15. The method of claim 14, comprising the further step of: 

selecting AC spatial transform coefficients for said summing step from one of said left-hand block and said top 
block according to the selected DC spatial transform coefficient. 

16. The method of claim 14, comprising the further step of: 

selecting AC spatial transform coefficients for said summing step from said top block when the DC spatial 
transform coefficient is selected from said top block, and said current block is coded according to the frame 
mode or the non-reordered field mode, and said top block is coded according to the frame mode or the non- 
reordered field mode. 

17. The method of claim 14, comprising the further step of: 

selecting AC spatial transform coefficients for said summing step from said top block when the DC spatial 
transform coefficient is selected from said top block, and said current block and said top block are both coded 
according to the reordered field mode. 

18. TTie method of one of claims 1 1 to 17, comprising the further step of: 

using zeroed AC spatial transform coefficients in place of the AC spatial transform coefficients from the 
selected block when the selected block is not INTRA coded. 

19. The method of one of claims 11 to 18, wherein said current block resides in a current video object plane (VOP), 
comprising the further step of: 

using zeroed AC spatial transform coefficients in place of the AC spatial transform coefficients from the 
selected block when the selected block does not reside in said current VOP 

20. The method of one of claims 1 1 to 19, wherein: 

said selecting step is responsive to a codeword provided in said digital bitstream which designates the AC spa- 
tial transform coefficients from one of said left-hand block and said top block. 

21 . The method of claim 1 4, wherein the DC spatial transform coefficient of the current block is non-linearly quantized, 
comprising the further steps of: 

recovering data from said digital bitstream indicative of the respective quantization levels of the selected block 
and the current block; and 

scaling the DC spatial transform coefficient of the current block according to the respective quantization levels 
of the selected block and the current block to recover a scaled DC spatial transform coefficient. 

22. An apparatus for coding spatial transform coefficients of a current, INTRA coded block in a digital video image, 
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comprising: 

means for identifying an adjacent left-hand block, an adjacent top block, and a top, left block which is adjacent 
to said (eft-hand block and said top block; 

said left-hand block, top block, and top, left block each having a DC spatial transform coefficient and a plurality 
of AC spatial transform coefficients; 

each of said blocks being coded in a coding mode according to one of a frame mode, a reordered field mode, 
and a non-reordered field mode; and 

means for selecting AC spatial transform coefficients from one of said left-hand block and said top block for use 

in differentially encoding respective AC spatial transform coefficients of said current block; 

said means for selecting being responsive to the coding mode of said current block and said top block. 

23. The apparatus of claim 22, wherein: 

said means for selecting selects the AC spatial transform coefficients from said left-hand block when said top 
block is coded according to one of the frame mode and the non-reordered field mode, and said current block 
is coded according to the reordered field mode. 

24. The apparatus of claim 22 or 23, wherein: 

said means for selecting selects the AC spatial transform coefficients from said left-hand block when said cur- 
rent block is coded according to one of the frame mode and the non-reordered field mode, and said top block 
is coded according to the reordered field mode. 

25. The apparatus of one of claims 22 to 24, wherein: 

said means for selecting selects a DC spatial transform coefficient for use in differentially encoding the DC spa- 
tial transform coefficient of said current block from one of said left-hand block and said top block according to 
(a) a gradient between the DC spatial transform coefficients of said left block and said top, left block, and (b) a 
gradient between the DC spatial transform coefficients of said top block and said top, left block. 

26. The apparatus of claim 25, wherein: 

said means for selecting selects AC spatial transform coefficients for use in the differential encoding from one 
of said left-hand block and said top block according to the selected DC spatial transform coefficient. 

27. The apparatus of claim 25, wherein: 

said means for selecting selects AC spatial transform coefficients for use in the differential encoding from said 
top block when the DC spatial transform coefficient is selected from said top block, and said current block is 
coded according to the frame mode or the non-reordered field mode, and said top block is coded according to 
the frame mode or the non-reordered field mode. 

28. The apparatus of claim 25, wherein: 

said means for selecting selects AC spatial transform coefficients for use in the differential encoding from said 
top block when the DC spatial transform coefficient is selected from said top block, and said current block and 
said top block are both coded according to the reordered field mode. 

29. The apparatus of one of claims 22 to 28, further comprising: 

means for using zeroed AC spatial transform coefficients in place of the AC spatial transform coefficients from 
the selected block when the selected block is not INTRA coded. 

30. The apparatus of one of claims 22 to 29, wherein said current block resides in a current video object plane (VOP), 
further comprising: 

means for using zeroed AC spatial transform coefficients in place of the AC spatial transform coefficients from 
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the selected block when the selected block does not reside in said current VOP. 

31 . The apparatus of one of claims 25 to 28, further comprising: 

5 means for non-linearly quantizing the DC spatial transform coefficient of the current block. 

32. An apparatus for decoding differentially encoded spatial transform coefficients of a current, INTRA coded block in 
a digital video image which is transmitted in a digital bitstream to recover corresponding full AC spatial transform 
coefficients of said current block, comprising: 

10 

means for recovering said digital bitstream; 

said digital bitstream comprising data from an adjacent left-hand block, an adjacent top block, and a top, left 
block which is adjacent to said left-hand block and said top block; 

said left-hand block, top block, and top, left block each having a DC spatial transform coefficient and a plurality 
is of AC spatial transform coefficients; 

each of said blocks being coded in a coding mode according to one of a frame mode, a reordered field mode, 
and a non-reordered field mode; 

first means for selecting AC spatial transform coefficients from one of said left-hand block and said top block; 
and 

20 means for summing said selected AC spatial transform coefficients with said corresponding differentially 

encoded AC spatial transform coefficients of said current block to recover said full AC spatial transform coeffi- 
cients of said current block; 

said first means for selecting being responsive to the coding mode of said current block and said top block. 

25 33. The apparatus of claim 32, wherein: 

said first means for selecting selects AC spatial transform coefficients for use by the summing means from said 
left-hand block when said top block is coded according to one of the frame mode and the non-reordered field 
mode, and said current block is coded according to the reordered field mode. 

30 

34. The apparatus of claim 32 or 33, wherein: 

said first means for selecting selects AC spatial transform coefficients for use by the summing means from said 
left-hand block when said current block is coded according to one of the frame mode and the non-reordered 
35 field mode, and said top block is coded according to the reordered field mode. 

35. The apparatus of one of claims 32 to 34, wherein the DC spatial transform coefficient of said current block is differ- 
entially encoded prior to transmission in the digital bitstream using the DC spatial transform coefficient from one of 
said left-hand block and said top block according to (a) a first gradient between the DC spatial transform coefficients 

40 of said left block and said top, left block, and (b) a second gradient between the DC spatial transform coefficients 
of said top block and said top, left block, further comprising: 

means for determining said first and second gradients; and 

second means for selecting the DC spatial transform coefficient which was used in differentially encoding the 
45 DC spatial transform coefficient of said current block from one of said left-hand block and said top block accord- 

ing to the determined gradients. 

36. The apparatus of claim 35, wherein: 

so said first means for selecting selects AC spatial transform coefficients for use by the summing means from one 

of said left-hand block and said top block according to the selected DC spatial transform coefficient. 

37. The apparatus of claim 35, wherein: 

55 said first means for selecting selects AC spatial transform coefficients for use by the summing means from said 

top block when the DC spatial transform coefficient is selected from said top block, and said current block is 
coded according to the frame mode or the non-reordered field mode, and said top block is coded according to 
the frame mode or the non-reordered field mode. 
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38. The apparatus of claim 35, wherein: 

said first means for selecting selects AC spatial transform coefficients for use by the summing means from said 
top block when the DC spatial transform coefficient is selected from said top block, and said current block and 
5 said top block are both coded according to the reordered field mode. 

39. The apparatus of one of claims 32 to 38, further comprising: 

means for using zeroed AC spatial transform coefficients in place of the AC spatial transform coefficients from 
10 the selected block when the selected block is not INTRA coded. 

40. The apparatus of one of claims 32 to 39, wherein said current block resides in a current video object plane (VOP), 
further comprising: 

75 means for using zeroed AC spatial transform coefficients in place of the AC spatial transform coefficients from 

the selected block when the selected block does not reside in said current VOP. 

41 . The apparatus of one of claims 32 to 40, wherein: 

20 said first means for selecting is responsive to a codeword provided in said digital bitstream which designates 

the AC spatial transform coefficients from one of said left-hand block and said top block. 

42. The apparatus of claim 35, wherein the DC spatial transform coefficient of the current block is non-linearly quan- 
tized, further comprising: 

25 

means for recovering data from said digital bitstream indicative of the respective quantization levels of the 
selected block and the current block; and 

means for scaling the DC spatial transform coefficient of the current block according to the respective quanti- 
zation levels of the selected block and the current block to recover a scaled DC spatial transform coefficient. 

30 
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